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Abstract — Cognitive radios process their sensed information 
collectively in order to opportunistically identify and access 
under-utilized spectrum segments (spectrum holes). Due to the 
transient and rapidly-varying nature of the spectrum occupancy, 
the cognitive radios (secondary users) must be agile in identifying 
the spectrum holes in order to enhance their spectral efficiency. 
We propose a novel adaptive procedure to reinforce the agility 
of the secondary users for identifying multiple spectrum holes 
simultaneously over a wide spectrum band. This is accomplished 
by successively exploring the set of potential spectrum holes 
and progressively allocating the sensing resources to the most 
promising areas of the spectrum. Such exploration and resource 
allocation results in conservative spending of the sensing re- 
sources and translates into very agile spectrum monitoring. The 
proposed successive and adaptive sensing procedure is in contrast 
to the more conventional approaches that distribute the sampling 
resources equally over the entire spectrum. Besides improved 
agility, the adaptive procedure requires less-stringent constraints 
on the power of the primary users to guarantee that they remain 
distinguishable from the environment noise and renders more 
reliable spectrum hole detection. 

Index Terms — Adaptive, agility, sparsity, spectrum sensing, 
wideband 



I. Introduction 

A. Background 

The notion of cognitive radio has emerged as a mean to 
alleviate the scarcity of available frequency spectrum through 
accommodating unlicensed (secondary) users within the under- 
utilized spectrum bands licensed to the legacy (primary) users. 
For this purpose, secondary users monitor the spectrum in 
order to identify and exploit the under-utilized segments. Such 
coexistence of the primary and secondary users is constrained 
by the necessary warranties on the quality of service provided 
for the primary users. 

For detecting an under-utilized segment in the wideband 
spectrum, which we hereinafter call a spectrum hole, sec- 
ondary users face two major challenges. First, the spectrum 
holes are spread across the wide band and often their occu- 
pancy status changes rapidly, i.e., the holes might not remain 
unoccupied for a long time. Therefore, the amount of time 
spent searching for a spectrum hole could potentially constitute 
a considerable fraction of the time that a spectrum hole 
remains vacant. Such rapid changes are governed mainly by 
the spectral activities of the active (primary and other potential 
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secondary) users seeking access to the same spectrum. There- 
fore, the secondary users should be agile enough to detect 
a spectrum hole in substantially less time than the period in 
which the hole remains vacant. Agility becomes even more 
prominent in wideband systems where the secondary users 
have to monitor a larger number of channels with a sparse 
distribution of spectrum holes. The current statistics about 
the spectrum occupancy patterns provide that a considerable 
portion of the spectrum is under-utilized, hence promoting 
the notion of cognitive communication. Nevertheless, by envi- 
sioning the futuristic scenario of cognitive networks, enabling 
cognitive communication allows many unlicensed users to 
compete for the same spectrum resources. Under this envi- 
sioned scenario and by taking into account the ever-increasing 
demand for data communication, the vacant spectrum bands 
will not be as abundant as they are, which makes the sparsity 
assumption quite reasonable. We would like to also remark that 
the same futuristic sparsity assumption on spectrum vacancy 
is also envisioned in JT|, 0. The second challenge is to avoid 
harming the communication of the primary users. For this 
purpose the secondary users must reliably distinguish the holes 
from the spectrum segments accessed by the primary users, 
regardless of how weak the primary users are. 

Sensing the spectrum, being a major task of the secondary 
users, has received a considerable amount of research interest, 
see, e.g., 0, 0, 0, 0, 0, 0. We review some of the 
related works in the following section. 

B. Related Works 

There exist, broadly, three approaches for identifying spec- 
trum vacancies within a wideband spectrum. A notable ap- 
proach on agile spectrum sensing in a wideband spectrum is 
the quickest sequential search scheme of [7|. In this scheme 
the spectrum is split into smaller narrowband channels and 
the secondary user senses one channel at a time. After ac- 
cumulating enough information about each channel it decides 
whether the channel is a hole or is occupied. If the channel is 
decided to be a hole, the search is terminated and otherwise 
the process is carried on until a hole is spotted. As the 
spectrum holes become less frequent (sparser) enhancing the 
agility of the secondary users becomes more significant It 
is noteworthy that when the spectrum holes are not sparse, 
the quickest detection approach Q is very effective. In such 
scenarios the expected density of the spectrum holes is high 
and it becomes very likely to encounter a spectrum hole after 
sensing a few channels. Therefore, by deploying the sequential 
quickest detection approach the expected amount of sampling 
resources required for identifying multiple holes is small and 
thereof the overall detection process will be very agile. For 
a sparse distribution of spectrum holes, however, sensing the 
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channels sequentially for identification of multiple holes will 
substantially lengthen the expected sensing time. Therefore, 
the objective of our proposed adaptive procedure is to address 
the issue of agility for a sparse distribution of spectrum holes 
and when we are interested in identifying multiple holes, 
as it is more challenging and realistic in practice. We also 
remark on the recent results on sequential binary hypothesis 
testing of multiple sequences |9| where a sparse number of the 
sequences are generated according to the null distribution and 
the majority of them according to an alternative distribution. 
It develops a sequential thresholding-based test for identifying 
almost all the sequences generated according to the null 
distribution. These tests tend to identify almost all of the 
sequences generated by the null distribution. 

In another approach it is assumed that the wideband channel 
is heavily under-utilized and it is used only rarely and sparsely. 
In this approach the cognitive radios exploit the sparsity 
structure of the wideband channel and construct compressed 
sensing-based machineries for estimating the power spectral 
density (PSD) of the wideband channel ITUl. ifTTl. fl2l. |[T3l 
El, 03], [16|. These approaches are further extended to also 
track temporal variations of the spectral occupancy during 
sensing 1171 . Ifl8l . Exploiting the sparsity empowers the cog- 
nitive radios to sample the signal activity over the channel at 
a sub-Nyquist rate, which expedites the process of estimating 
the PSD. While sampling at a sub-Nyquist rate substantially 
reduces the number of samples required for reconstructing the 
PSD, the number of samples required yet scales linearly with 
the bandwidth of signals of the active users. Therefore, the 
amount of required samples for reconstructing the PSD of 
large-bandwidth signals (e.g., video signals) is substantially 
large. When each of the channels is narrowband, however, 
estimating the PSD encompasses much redundancy as the 
ultimate objective of the cognitive user is to make a 1-bit 
decision about a channel (hole vs. occupied). By taking this 
fact into account, our proposed sensing procedure further 
reduces the sampling resources below the level required by 
the compressed-sensing-based methods. 

In another approach tailored for cognitive networks a group 
of cognitive radios are clustered to collaboratively perform 
spectrum sensing 0], 0D> H3, HD, E0). At the cost of 
extra communication among the cognitive radios, they can 
share their individual perception about the spectral occupancy 
and reach a more reliable decision through exchanging their 
individual perceptions (coordination). Such information ex- 
change is viable in cognitive networks with a fusion center or 
an established infra-structure through which the autonomous 
users can be coordinated. In this paper we consider ad-hoc 
cognitive networks, in which each cognitive user operates 
autonomously and independently of the rest of users. In such a 
scenario, the task of reliable spectrum sensing must be carried 
out by each user independently. It is noteworthy that our 
approach can also be integrated with the existing collaborative 
sensing methods. 

C. Contribution 

In this paper we propose a novel adaptive spectrum sensing 
procedure for ad-hoc cognitive networks. The underlying 



premise is that one can adaptively decide how to spend a given 
sampling budget in the course of the spectrum sensing process, 
focusing the sensing resources on more promising segments of 
the spectrum. This is in contrast to the conventional spectrum 
sensing schemes with a pre-defined sensing strategy. Our pro- 
posed adaptive procedure consists of two phases, namely the 
exploration and detection phases. The errand of the exploration 
phase is to swiftly eliminate a substantial portion of the 
occupied channels and at the same time retain most of the 
holes. Exploration is carried out through an iterative process, 
where in each iteration a group of less promising segments of 
the spectrum are identified and eliminated. The fundamental 
basis of the exploration phase is that it can be done accurately 
even if the sensed information (measurements) is very rough 
and noisy. During exploration, each iteration further monitors 
the spectrum and improves on the outcome of the previous 
iteration. The exploration phase is subsequently followed by a 
detection phase for identifying multiple spectrum holes among 
the reduced number of channels retained. 

The analysis reveals that, if we perform multiple cycles 
of the proposed explorations, for achieving a certain level 
of reliability, the ratio of the sampling budget required by 
the proposed adaptive procedure to that required by the non- 
adaptive procedure is approximately j^, where M is the 
sampling budget per channel in the non-adaptive procedure. 
Furthermore, the adaptive procedure requires less-stringent 
constraints on the power of the active users to guarantee a 
successful hole detection. In particular, let e n be the probability 
that each channel is a spectrum hole. Then the requirement for 
identifying spectrum holes by the non-adaptive procedure is 
that the power of the active users must scaleQ as u) ( r ^ff-\ > 
where M is the sampling budget per channel. By deploying the 
adaptive procedure this requirement reduces to approximately!! 
U) ^ M \[^~) ' wnere M 1 is an integer substantially larger than 
M. 

The exploration phase and the idea of successively searching 
for the spectrum holes is inspired by the notion of distilled 
sensing developed in Ell . [22) , 0231 . In distilled sensing it is 
shown that given a fixed sampling budget certain signals that 
are detectable/estimable using adaptive measurements cannot 
be recovered using non-adaptive strategies. The results show 
that closing the loop between the data analysis and collection 
processes can yield very significant gains. Despite the sim- 
ilarities between our exploration phase and distilled sensing 
in ETI . E2l . E3l . there exist also significant differences 
mainly in two aspects: 

1) We desire to identify any arbitrary number (and not 
necessarily all) of the spectrum holes where distilled 
sensing aims to identify all data points that bear a certain 
sparsity structure. 

2) The observation models are different. The works in 
Ell . E2ll . E3l considers a Gaussian observation model 

'For convenience we will use the following asymptotic notations: for 
two sequences a n > and b n > we say that a n = ui(b n ) if 
liminfn-^oo j^ 1 - = 00 and a n = o(b n ) if limsupj^^ = 0. Also 
ctn = b n indicates asymptotic equality of a n and b n , i.e., lim^—n^o ^ = 1. 
The precise characterization is given in Section |IV] 



3 



and the theoretical guarantees depend on the Gaussian 
assumption. On the other hand, in the spectrum sensing 
problem in hand we deal with data drawn from Gamma 
distributions. The analysis for the Gamma distribution 
and Gaussian distributions are significantly different. 
This is due to the fact that the experimental design for 
distilled sensing is primarily shaped up by the statistical 
model of the data samples, which in turn necessitates 
independent analysis for each class of distributions. 
The remainder of this paper is organized as follows. In 
Section [II] the system model and the formal statement of 
the problem are provided. Section Hn] introduces the non- 
adaptive hole detection scheme, which serves as a baseline for 
assessing the performance of the proposed adaptive procedure. 
The non-adaptive hole detection scheme also represents the 
structure of the detection phase that we deploy in the adaptive 
procedure. In Section [IV] we describe the proposed adaptive 
hole detection procedure and its related analysis. Simulation 
results are provided in Section [V] and Section |VI] provides 
some concluding remarks. 

II. System Descriptions and Problem Statement 
A. System Setup 

Consider a wideband spectrum to be shared by license- 
holding (primary) users and cognitive (secondary) users with 
interference-avoiding spectrum access. The primary users are 
formally allocated a number of channels in the spectrum 
and have the right to instantaneously access the channels 
whenever desired. The secondary users, on the other hand, 
are allowed to opportunistically seek the portions of the 
spectrum under-utilized by the active users (either primary 
users or secondary ones currently using the spectrum) and 
access them. Our objective is to provide an agile and reliable 
mechanism for identifying such communication opportunities 
for the secondary users. 

We assume that the available spectrum consists of n non- 
overlapping channels, indexed by {l,...,n}. At a given 
instance, the active users communicate over some of these 
channels, which we refer to as occupied channels, and under- 
utilize the rest, which we call spectrum holes. We assume that 
the occupancy status of the spectrum remains unchanged dur- 
ing spectrum sensing process and and consider a probabilistic 
model for the occupancy of the spectrum. Let the Bernoulli 
random variable Zi, for i £ {1, . . . , n}, indicate the occupancy 
state of the i th channel, where Zi = 1 means that the i th 
channel is occupied and Zi = states that the i th channel is a 
spectrum hole. We assume that the channels have statistically 
independent occupancy states and each channel is a spectrum 
hole with probability e n . Therefore, 

Z t ~ Bernoulli^ - e„) . (1) 

Let us also denote the set of the indices of the spectrum holes 
and the occupied channels by 

U = {i€{l,...,n}:Z i = 0} , (2) 
U x = {i€ {!,... ,n} :Zi = 1} . (3) 



In order to capitalize on the availability of spectrum holes, 
the secondary users monitor the spectrum via channel mea- 
surements. Each measurement of channel i, denoted by Xi, is 
of the form 

Xi = ^piHi-Si-Zi + Wi, for ie{l,...,n}, (4) 

where Pi accounts for the combined effect of the transmission 
power of the user active on the i th channel and its associated 
signal attenuation (for i £ Hi), Hi denotes the fiat-fading 
channel from the active user to the secondary user and 
is distributed as zero-mean unit-variance complex Gaussian. 
Also, Si denotes the normalized unit-power signal of the active 
user active on the i th channel, for i 6 Hi- Finally Wi denotes 
the additive white Gaussian channel noise distributed as zero- 
mean unit-variance complex Gaussian. The measurements are 
statistically independent over time and channels. 

Finally, we assume that the powers of the active users sensed 
by the secondary user are lower-bounded by 7„ for some 
arbitrary j n > 0, i.e., pi > j n for all i e Hi- Acquiring 
7„ is feasible for secondary users if the geographical extent 
(and subsequently the maximum path-loss) and the range of 
the transmission power of the active users are known to them. 
Clearly the likelihood of channel vacancy e„ and the lower 
bound 7„ influence the precision of successfully detecting 
a hole; increasing j n and e„ enhances the reliability of the 
spectrum sensing process. 

B. Spectral Monitoring Goal 

The main objective is to use measurements of form (|4|l in 
order to identify T € N spectrum holes, i.e., T element of Hq. 
The conventional non-adaptive spectrum sensing procedures 
perform some pre-specified measurements of the spectrum 
and possibly locate T spectrum holes based on the informa- 
tion extracted from the measurements. This strategy is non- 
adaptive, in the sense that the measurement process is fixed a 
priori and does not change during the experiment. In contrast, 
we devise an adaptive procedure such that the measurement 
strategy is adjusted sequentially such that future measurements 
use information gathered from previous ones. We demonstrate 
that such measurement adaptation substantially improves the 
reliability and agility of the secondary users in detecting 
spectrum holes. 

We assess the agility by quantifying the amount of time 
required for sensing the channel. Agility depends linearly 
on the aggregate amount of measurements (sampling budget) 
made over the entire wideband spectrum, assuming that the 
measurements take the same amount of time. Therefore, 
increasing the sampling budget translates into more delay 
(or less agility) in detection. On the other hand, increased 
sampling budget improves the detection reliability. Therefore, 
for detecting multiple spectrum holes there exists a tradeoff 
between agility and reliability. Incorporating the impact of 
channel occupancy likelihood (characterized by e„) and the 
power of active users establishes that in both adaptive and non- 
adaptive procedures, there exists an inherent interplay among 
the agility, reliability, channel occupancy likelihood, and the 
power of active users. 
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We characterize such interplays in both non-adaptive and 
the proposed adaptive sensing procedures. Comparing these 
interplays subsequently demonstrates the agility and reliability 
gains afforded by the adaptive procedure. As we consider 
wideband channels, the analysis provided are asymptotic with 
respect to large number of channels, n. 

III. Non- Adaptive Spectrum Detection 

In this section we analyze an optimal non-adaptive detection 
scheme for locating T spectrum holes. The development 
of this scheme and the ensuing analysis serve a two-fold 
purpose. On one hand this detection scheme is also deployed 
in the detection phase of the adaptive procedure proposed in 
Section[IV] On the other hand, it offers a baseline for assessing 
the gain yielded by the adaptive procedure. 

A. Non-Adaptive Sensing Procedure 

Constructing a non-adaptive spectrum sensing procedure 
involves two issues. The first one pertains to the experimen- 
tal design, which is the design of the information-gathering 
process. In our setup the experimental design elucidates the 
distribution of the sampling budget among the channels across 
the wideband spectrum. The second issue is to design a 
detector based on some optimality criterion. The decision on 
the experimental design hinges on the expected proportion of 
spectrum holes, i.e., e n . Prior to the sensing procedure, there 
is no extra side information about spectral activity in any of 
the channels and all the channels are equally likely to be 
spectrum holes. Due to the inherent symmetry and the sparse 
distribution of the spectrum holes (e„ € o(l)), we assume that 
the experimental design measures all channels equally (note 
that if e„ is large, this is not the best strategy). 

Given this experimental design, it is straightforward to con- 
struct a detector that is optimal in the sense that it maximizes 
the a posteriori probability of successfully detecting T spec- 
trum holes, i.e., the maximum a posteriori (MAP) detector. 
Suppose that the measurement budget is B G N, meaning 
that we make a total of B measurements of the form (g). 
Therefore, the experimental design obtains M — [B/n\ 
measurements of the form (|4) per channel. Define Xi(j) 
as the j th measurement of the i th channel. Recalling the 
assumption that the occupancy statuses do not change over the 
course of sensing, the terms {p^ and {Z^} remain constant 
for all j 6 {1, . . . , M}, while the fading coefficient Hi(j), 
the noise term Wi(j), and the transmitted symbols Si(j) 
might change from one observation to another. Defining the 
vectors X, = [X t {j)] 3 , H, = [H^j)^, S z = [S z (j)] 3 , and 
W{ = [Wi(j)]j, based on the observation set is given by 

V n = {Xi : Xi = y/pl Zi HioSi + Wi for i = 1, 



(5) 

where "o" denotes the Hadamard product. From (@) it follows 
that given the occupancy status Zi, the observation sample 
Xi(j) has a complex Gaussian distribution with mean zero 
and variance (1 + PiZi), i.e., 

Xi(j) | Z 2 ~ A/" c (0, 1 +PiZi), for j £ {!,..., M} . (6) 



According to (Q}, Z t has a Bernoulli distribution with mean 
1 — e„. Therefore the measurement vector Xi is a sample from 
a mixture distribution with the probability density function 



- M exp(-||X s: 



(1 



(7T(l+ Pi )) M 



exp 



1+Pi 



(7) 



Given the observation set T> n , the maximum a posteriori 
(MAP) criterion for identifying T spectrum holes entails 
finding the subset of channels with cardinality T that has the 
highest probability of containing only holes and is formalized 
in the next remark. 

Remark 1: The MAP rule for detecting T holes is given by 



ZCp = arg max P{U C H Q \ V n ) 

U: \U\=T 

= are' min > Ui , 



where 



(l+ Pl ) M 



exp 



l+Pi 



(8) 



(9) 



Proof: See Appendix . ■ 
Due to the dependence of {Ui} on {pi}, the posterior proba- 
bility heavily depends on the power of the active users {pi}. 
The uncertainties about {pi} translates into uncertainty about 
any performance measure of interest, which in turn prevents us 
from obtaining a globally (over all power realizations) optimal 
hole detector. As a remedy, by alternatively exploiting the 
notion of robustness in the worst-case we can offer some 
worst-case guarantees. In particular, we look at the realizations 
of {pi} which yield the worst-case performance for detecting 
T holes. For this purpose, define ^(jn) as the class of active 
users satisfying the power constraint pi > j n for all i E H\. 
Then, the worst-case detection performance in the ^'(7^) class 
of active users is given by 



, min P(U N U 



Note that for i 6 Hi, the i th channel becomes less- 
distinguishable from a spectrum hole as pi decreases. There- 
fore, due to the independence among the distinct channels, 
the weakest detection performance, which occurs when the 
non-holes are least-distinguishable from the holes, corresponds 
to the smallest possible choices of {pi}, i.e., pi = j n 
for i € Hi- For this setting, finding the indices of the T 
smallest elements of the set {Ui, . . . , [/„} becomes equivalent 
to finding the indices of the T smallest elements of the 
set {||Xi|| 2 , . . . , ||X„|| 2 }. Therefore, in order to locate T 
spectrum holes the MAP detector requires only the sufficient 
statistics Yi = \\Xi\\ 2 for i = 1, ...,n. Corresponding to 
the sequence of random variables {Yi, . . . ,Y n } we define 
{Y(i), . . . , as the sequence of order statistics in an in- 

creasing order, e.g., F( m ) represents the m th smallest element 
of {Yi,...,Y n }. Based on this definition, the robust hole 
detector (with the non-adaptive procedure) can be formalized 
as follows. 
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Remark 2 (Robust Spectrum Detector): In the 4 r (7 n ) class 
of active users when e„ = o(l), the robust non-adaptive 
spectrum detector for identifying T holes is given by 



U?: = {ie{l,...,n}:Y i <Y [T) } 



B. Asymptotic Performance 

Recalling the distribution of Xi(j) given in ([§}, the suffi- 
cient statistics Yi is distributed as 

Yi | Zi ~ Gamma (M, 1 + piZi) for i = 1, . . . ,n , 

where Gamma (a, 6) denotes a Gamma distributiorH with 
parameters a and b. For assessing the performance of the 
robust detection scheme, for all i £ Hi we set pi = -f n . 
Clearly the robust detector will make a detection error if 
^Cb D^i 7^ Let us define ut and v t as the indices of the 
t th smallest elements of the sets {Yi} and {Y ■ i £ "Ho}, 
respectively. From Remark |2] the worst-case detection error 
probability is given by 



x min pfQ^nHijt 

{p,}c*( 7 „) V 



= 1- max P Iffn«i = 

{p,}C*( 7 „) V 

= 1— max f({lti, • • • ,Ut} C Kn) , 
ta}c*( 7 „) 

Note that having all the T smallest measured energies belong- 
ing to the holes is equivalent to not having the energies of all 
non-holes greater than the T smallest measured energies, i.e., 

{{ui, . . .,u T } 6 Ha} 

{Y Vt <Yi , ViG{l,...,T} and Vie Hi} 

max Y v . < min Yi 
te{l,...,T} ieUi 



Y v „ < min K 



Therefore, 



PnaW = 1 - max P(Y VT < min Fj) , (11) 

Assessing P NA (n) as defined above relies on the properties 
of the order statistics of a set of random variables. The 
following lemma is instrumental for characterizing the dis- 
tributions of the order statistics and evaluating P NA (n). This 
is a generalization of a well-studied problem in the context 
of extreme value theory that considers the first order statistic 
[24 1 for Gamma distributions. In this lemma, we give the 
corresponding results when distribution evolution is allowed, 
i.e., the number and distribution of the involved random 
variables changes simultaneously, as well as the analysis for 
higher order statistics, for which the existing results are not 
applicable. 

Lemma 1: Let {^}™ i be a sequence of i.i.d. random 
variables distributed as Gamma (M, a m ) and denote its cor- 
responding sequence of order statistics by {Yifj}^!^. Let 

3 The random variable G with distribution Gamma (a, b), where a, b > 0, 
has a density with respect to the Lebesgue measure of the form fa(t) = 



a-l exp(-i/b) 



l{t > 0}. 



h — 



r(M+i) 



and for some T € N define the 



sequence of random variables W, 



Then as m 



£^ for i 



1,...,T. 



WVX converges in distribution to a random 

(0 & 



(10) variable Wu) with cumulative density function (CDF) 



= P(W (i ) <w) = l- cxp(-w M ) 



*-! a „kM 



W 



fc=0 



A:! 



Proof: See Appendix lAl ■ 
For the setting of Section IH-AI the following theorem 
characterizes the asymptotic performance of the robust hole 
detector in the ^( r y n ) class of active users. It also establishes 
the tradeoffs among the spectrum vacancy likelihood e n , per 
channel sampling budget M, and the minimum power of active 
users. 

Theorem 1 (Non-Adaptive Tradeoff): For the ^(jn) class 
of active users, when e„ = o(l) and ne n = uj(1), the error 
probability of the robust detector for identifying T spectrum 
holes is given by 



min p(ult^H^ 

{ Pi }c*( 7 „) V 



l-(l + [(l + 7n ) 



M 



(12) 



Proof: For assessing the performance of the robust detec- 
tion scheme, we set = j n for alH G Hi, as this corresponds 
to the worst case scenario. The number of holes denoted by 
\Hq\, is a random quantity. We proceed by conditioning on 
{Zi}™ =1 in what follows, but for notational convenience we 
do not explicitly represent this dependence. Define no = \Ho\ 
and rii = \Hi\ = n ~ uq (conditionally on {Zi} these are 
constants). 

We are now ready to apply Lemma Q] on the two se- 
quences of i.i.d. random variables {Yi : i G Hq} dis- 
tributed as Gamma (M, 1) and {Yi : i G Hi} distributed as 
Gamma (M, l + j n ). Corresponding to these sequences define 



^0 A 



T(M + 1) 



l/M 



and 



Also define 
1 



r(M + i) 



l/M 



(13) 



(14) 



b° 



Y i: i G H a , and = —Y t , i G H\ . 

(15) 

For convenience denote the probability density functions 
(PDF) of Wfo by q i;0 {w;m) = £Q vfi ( w -ni) and PDF of 
b y Qi;i(w,ni) = -^Qi;i{w;m) (we drop the explicit 
dependence on n to avoid notational clutter). Note that {W 7 ^} 
and {W 7 ^} are statistically independent. In what follows we 
are going to be interested in the order statistics of these 
sequences, in particular W%y , i G {1, . . . , T}, the smallest T 
elements of {W^ }i & f{ sorted in increasing magnitude, and 
W, 



mm ie u 1 Wfiy v Taking this into account and noting 
that, by the law of large numbers, the assumptions e n — o(l) 



(i);i 
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and ne n = w(l) imply that no, n\ — > oo as n —> oo, we have 
according to Lemma Q] 



Q{i)-i{w\n 1 ) = 1 - exp(-w ' ) 



Q W; o(ti»;no)=P(W^. <u>) 



= 1 -cxp(~w M )J2 



i—l i~m 
W 



k=0 



k\ 



As a result, by invoking (fTTT i when no and ni approach infinity 



min P CnKi^D {Z*} 

{ Pi }C*( 7 „) 



is characterized in ([T6V We now remove the conditioning on 
{.Zi}, by computing the expectation of the expression above. 
Recall that no/ni is a random quantity, and by assumption 
and the law of large numbers no/ni — > e n . Using this fact 



and continuous mapping principles we conclude that 

min P I 

{ Pi }c*( 7 „) V 




= 1 - 

as desired. ■ 
As expected, there exists a tension between reliability and 
agility. On one hand, increasing the sampling budget per 
channel M favors reliability, as according to (fTZt it improves 
the probability of successfully detecting a hole. Increasing 
the sampling budget, on the other hand, imposes more delay 
in spectrum sensing as the time required for completing the 
acquisition of data scales linearly with the sampling budget. 
Finally, (TTZt also states that an increase in the minimum power 
of the active users 7„, improves the reliability. Theorem Q] 
characterizes the tradeoff among reliability, hole detection 
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agility, and the power of active users. By using the result 
of Theorem [T] we offer a necessary and sufficient condition 
on the scaling of the power of the active users to guarantee 
asymptotically error-free multiple hole detection in the non- 
adaptive sensing setting. 

Corollary 1 (Non-Adaptive Power Scaling): For the 4 r (7 n ) 
class of active users, when e n = o(l) and ne n = w(l), the 
necessary and sufficient condition for P NA (n) — > as n — > oo 
is that 7„ scales with increasing n as 

In other words, if the power of the "faintest" active user grows 
faster than ^\J~^f- , then a secondary user can reliably identify 
T holes by employing this non-adaptive procedure. The proof 
follows in a straightforward way from the characterization of 
P NA (n) given in ( fT2] i by noting that P NA (n) —> is equivalent 
to (1 + 7„) M • e„ -> oo. 

IV. Adaptive Spectrum Detection 
A. Adaptive Sensing Procedure 

Our proposed adaptive spectrum sensing procedure has 
two phases, namely the exploration phase and the detection 
phase. The exploration phase, being an iterative procedure, is 
intended to purify the set of the channels to be sensed carefully 
for detecting spectrum holes. This phase is accomplished by 
successively identifying and eliminating a group of channels 
deemed to be occupied. The detection phase is performed after 
the exploration phase in order to identify T holes among the 
subset of candidate channel retained after exploration. The 
detection scheme deployed is identical to the robust spectrum 
detection scheme of Section HTT1 

The exploration phase proceeds in an iterative way. In 
each iteration it further monitors the channels retained by the 
previous iteration and eliminates those deemed to be spectrum 
holes least-likely. The core idea is that it is relatively easy 
to identify occupied channels with low-quality measurements 
since there are few holes available (recall that e n is small). 
Each iteration carries on by thresholding the observed en- 
ergy on each channel retained by the previous iteration. The 
threshold level depends only on j n , and is designed such 
that at each iteration roughly half of the existing occupied 
channels are eliminated, while almost all the of spectrum 
holes are preserved. The output of each exploration phase 
will have a more condensed proportion of spectrum holes to 
occupied channels. Subsequently, the detector developed for 
the non-adaptive procedure is applied on this refined set of 
channels in order to identify T spectrum holes. This entire 
procedure bears similarities with Distilled Sensing [21], \22\, 
[23 1, however, the analysis is substantially different. This is 
due to the different sensing objective (identifying any arbitrary 
number of holes as opposed to ll2D . 11221 . Il23l that aims to 
identify almost all) as well as the underlying statistical model. 

We show that the gains yielded by this adaptive procedure 
can be interpreted in two ways. First we demonstrate that 
when targeting the same level of reliability in hole detection, 
the adaptive procedure requires substantially less sampling 



budget, or equivalently it is substantially more agile. Secondly, 
we show that under the same sampling budget, and targeting 
identical hole detection reliability, the adaptive procedure 
imposes less-stringent conditions on how fast the power of the 
active users 7„ must scale with increasing n. This essentially 
indicates that for some choices of j n the adaptive procedure 
can guarantee successful hole detection while the best non- 
adaptive procedure fails to do so. 

Let us define K as the number of exploration cycles 
(iterations) in the exploration phase. Also denote the sampling 
budget per channel in the k th exploration cycle by M%. The 
exploration phase is initialized by including all channels for 
sensing and resumes as follows. In the first iteration all chan- 
nels are allocated the identical sampling budget of My. The 
energy levels of all channels are compared against Ai(l+7„), 
where Ai is the median of the distribution Gamma (My, 1). 
The channels for which the measured energy exceed this 
threshold are discarded and the rest are carried over to the 
second iteration for further sensing. The same procedure is 
repeated throughout all K cycles. More specifically, in the 
k th cycle all the channels retained by the (k — l) th iteration 
are allocated the identical sampling budget of M k . The energy 
levels of these channels are compared with Afc(l +7 n ), where 
Afc is the median of the distribution Gamma (M&, 1) and 
the exploration is performed via thresholding as in the first 
iteration. Finally, after the exploration phase, each of the 
remaining channels is allocated the sampling budget of Mk+i 
and the robust spectrum detection scheme provided in Remark 
|2] is applied in order to detect T holes. 

We set Go = {1, . . . , n} and for k — 1,...,K, we define 
Gk as the set of the indices of the channels that are retained by 
the k th exploration cycle. Clearly we have Gk C . . . , C Qy C 
Go and Gk contains the set of the indices of the candidate 
channels among which T holes will be detected. The set of 
measurements defined for the non-adaptive scheme in is 
extended for the proposed adaptive procedure as follows. We 
define the set of measurements as 

X>*^ {x\ : X\ = y/pi Zi H^oS^+W^ for i £ Gk-i] , 

for k = 1, . . . , K + 1. The measurement sets X>*, . . . , D„ are 
processed in the exploration phase and the measurement set 
T>n +1 is used in the detection phase. Based on the model of 
channels dynamics described in Section III- AI only the spectral 
occupancy {Zi} and the power of active users {pi} remain 
unchanged during all observations, and fading, channel noise 
and transmitted signal change to independent states after each 
observation. Therefore, given the occupancy status Zi, the 
observation sample X^(j) for k = 1, . . . , K + 1 is distributed 
as 

x i(j) I z % ~ -A/c(0, 1+piZi) , (18) 

for i £ Gh-x an d j = 1) • • • > M. We also define 

Y t k = \\X k t \\ 2 for i£G k -i and k = l,...,K + l. 

(19) 

Equations ( IT8l and (T% provide that for a given Zi, is 
distributed as 

if | Zi ~ Gamma (M k , 1 + Vl Z t ) , (20) 



s 



for i £ Gk-i and k — 1, . . . , K+l. For each k = 1, . . . , K+l, 
corresponding to the sequence {Y i k }i < zg k l we define the 
sequence of order statistics \Xu\}ieQh-i m an increasing 
order such that Y"A represents the i th smallest element of 
this sequence. The adaptive sensing procedure is formally 
described in Table 1. 



B. Asymptotic Performance 

We start by assessing the performance for any given value 
of the exploration cycles K and relegate the discussions on 
the optimal design of K to the next section. The analysis of 
the adaptive sensing procedure follows the approach of [21], 
[22 1, albeit with the non-trivial modifications to deal with the 
different objective and the different observation model. The 
following lemmas shed light on how the adaptive procedure 
accomplishes the exploration cycles. Lemma [2] characterizes 
the proportion of the spectrum holes that are retained in each 
exploration cycle. 

Lemma 2: Let mo = \Hq\ and for k = 1, . . . , K define m k 
as the number of holes retained by the k th exploration cycle. 
Conditionally on rrik-i for k = 1, . . . , K and for sufficiently 
large n the event 



7* 



In 



m.k-1 <m k < m k -i 



(21) 



holds with probability at least 1— cxp (— m '°~ 1 ) for any a > 0. 

Proof: See Appendix iBl ■ 
The next lemma shows that during each exploration cycle 
almost half of the non-holes are eliminated. 

Lemma 3: Let £q = \Hi\ and for k = 1, . . . , K define £ k 
as the number of the occupied channels retained by the k th 
exploration cycle. Conditionally on £ k -i for k = l,...,K 
and for sufficiently large n the event 



1 



Ck 



logn 



4-1 < 4 < c fe + 



1 



log n 



(22) 



where 



holds with probability at least 1 — 2exp (— ^ n yi 
Ck < \ is a constant. Furthermore, c% = \ if and only if 
Pi = 7„ for i E Hi. 

Proof: See Appendix ICl ■ 
A careful use of the above lemmas establishes the performance 
of the adaptive robust spectrum detection in the following 
theorem. 

Theorem 2 (Adaptive Tradeoff): For the 4 r (7 n ) class of ac- 
tive users, when e„ = o(l) and ne n = ui(l), the error proba- 
bility of the adaptive robust spectrum detection procedure for 
identifying T spectrum holes is given by 



P A (n) 



min P (Q\ C\Hii= 



1 



■(23) 



Proof: See Appendix iDl ■ 
The tradeoff above suggests that the asymptotic performance 
of the adaptive procedure does not impose any constraint on 
the choice of M k for k = 1, . . . , K, other than the trivial 
constraint M k > 1, Hence, for achieving the best asymptotic 



performance the best sequential experimental design requires 
maximizing Mjc+i, i.e., the sampling budget per channel in 
the detection phase. More specifically, given a fixed sampling 
budget, i.e., a "cap" on the maximum number of measurements 
available, the best asymptotic strategy is to allocate as much 
as possible sampling budget for the detection phase and to 
allocate as low as one sample per channel in each exploration 
cycle. This implies that as low as one sample per channel in 
each exploration cycle suffices to ensure that for sufficiently 
large n, the exploration phase retains almost all the holes and 
discards almost half of the occupied channels. Therefore, to 
achieve the best asymptotic behavior we should set M k = 1 
for k = 1,...,K. 

In order to quantify the gains yielded by the adaptive 
algorithm, we compare the results for the non-adaptive and 
adaptive schemes provided in Theorems Q] and [2] In particu- 
lar, with the aim of attaining identical asymptotic reliability 
levels in the adaptive and the non-adaptive procedures, i.e., 
P A (n) = P NA (n), we characterize the agility gain, which we 
define as the ratio of the sampling budgets required by the non- 
adaptive procedure to that required by the adaptive scheme, 
i.e., 

agility gain 

A sampling budget of the non-adaptive procedure 
sampling budget of the adaptive procedure 

Theorem 3 (Agility): For the 1 J r (7 n ) class of active users, 
when e„ = o(l) and ne n = w(l), the agility gain of the adap- 
tive robust spectrum detection algorithm with Mn sampling 
budget is asymptotically lower bounded by (jk+^j) , 
where K is the number of exploration cycles. 

Proof: See Appendix [E] ■ 
It is noteworthy that while the number of exploration cycles 
K can be made arbitrarily large (but fixed as a function of 
n), increasing it beyond some point will affect the agility 
very insignificantly. More specifically, for large K, the agility 
gain lower bound will be dominated by the term 4p This 
underlines the fundamental limit of the agility gain yielded by 
the adaptive procedure. 

An analogue of Corollary Q] can be derived for the adaptive 
procedure, providing a necessary and sufficient condition on 
the scaling of j n for guaranteeing a reliable hole detection with 
the proposed algorithm. For comparison purposes we assume 
that both adaptive and non-adaptive procedures are granted the 
same sampling budget. 

Corollary 2 (Adaptive Power Scaling): For the ^('Jn) 
class of active users, when e„ = o(l) and ne n — u>(l), given 
that the sampling budget is Mn, a necessary and sufficient 
condition for P A (n) — > as n — > oo is that 



In 



1 



2 K e n 



(24) 



where M' > 2 K (M - 2) + 2. 

Proof: See Appendix [F] ■ 
Comparing the result above with that of Corollary Q] shows 
that an adaptive scheme can cope with much weaker active 
users than a non-adaptive scheme. More specifically, by noting 
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Exploration phase 



Input K £ N and {Mi, . . . , M K+1 } where M k £ {1,2, 3, . . . }. 
Initialize the index set Qq 4— {1, . . . , n}. 
for k = 1, . . . , K do 

IX^II 2 for ieS fe _i 
f oo for i ^ 

Obtain C?fc {* €E | Y^ fc < Afc(l + 7n)} where Aj, is the median of Gamma(Affe, 1). 
end for 



Set Y k = 



Detection phase 



7: SetlfM l' X " +1 "' 2 ! OT ^ . 

(_ +oo for i f y K 

8: Identify the indices of a spectrum holes U T A ob = {i £ Q K : Y t K+1 < Y^ 1 }. 

9: Output Uj w 

TABLE I 

ADAPTIVE ROBUST SPECTRUM DETECTION ALGORITHM 



that M' is substantially larger than M, the power scaling 
requirement in the adaptive scenario, which is smaller than 
uj ( M \l~~tj becomes substantially smaller than its counterpart 

in the non-adaptive scenario us ^ A {/?) ' ^ s a resu l t ' there 
are scenarios where non-adaptive schemes fail to successfully 
identify T holes, while the adaptive scheme succeeds. 

C. Optimal Exploration Cycles (K) 

The role of the exploration cycles is to feed the detector 
with a group of channels with a more condensed proportion 
of spectrum holes to occupied channels. According to the 
characterization of P A (n) given in d23l >, changing K has two 
effects on the asymptotic behavior of P A (n). Specifically, 
increasing K results in a decrease in P A (n) through its direct 
impact via the term 2 K . Increasing the exploration cycles K, 
on the other hand, increases the sampling resources during 
exploration and leaves less resources for the detection phase, 
i.e., Mk+i decreases. Following the same arguments as in 
the proof of Theorem [3] it can be readily verified that the 
superposition of the two effects implies that P A (n) will be 
monotonically decreasing with increasing K. The results of 
Theorem [3] and Corollary [2] also substantiate that increasing 
K enforces a less stringent constraint on power scaling and 
enhances the agility. 

An important feature in the analysis of the proposed algo- 
rithm was that the exploration step asymptotically retains all 
the existing holes, while discarding a large number of occupied 
channels. This ensures the task of the detection phase is much 
more effective. As all the results presented so far hint that a 
large value of K yields higher gains it is natural to ask what 
is the "optimal" value of K so that the exploration phase 
asymptotically retains all the holes. This imposes an upper 
bound on possible values of K, and gives rise to the following 
theorem. 

Theorem 4: The optimal growth of the number of cycles 
K so that the exploration phase asymptotically retains all the 




Number of Channels (n) 



Fig. 1. Comparison of the error probabilities P A (n) and P N a(^) for detecting 
T = 2 over a wide range of n. 



holes for consideration in the detection phase is given by 



K* = log log — 



(25) 



Furthermore, the outcome of Theorem 2 remains valid with 
any choice of K not exceeding K* , which is no longer fixed 
as a function of n. 

Proof: See Appendix IGl ■ 

V. Simulation Results 

In this section we provide some simulation results to 
empirically demonstrate the effectiveness of the proposed 
adaptive procedure. In Figure [T] we aim at identifying T = 2 
spectrum holes and compare the reliability of the non-adaptive 
scheme with that of the proposed adaptive procedure with 
K = 1, ... ,4 exploration cycles. Comparisons are provided 
over the range of n = 10 — 1000 channels. The sampling 
budget in both schemes is set as 5n which means that in the 
non-adaptive scheme each channel is measured M — 5 times. 
In the adaptive scheme in each cycle of the exploration phase 
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Fig. 2. Average number of observations taken per channel versus the number 
of channels n. All the curves correspond to the target error probability for 
detecting T = 2 holes is P A (n) = P NA (n) = 10~ 4 . 



each channel is measured once. The sampling resources not 
used in the exploration phase are equally divided among the 
remaining channels retained by the exploration phase. We set 
the channel occupancy probability e n = n~ 2 ' 3 , which clearly 
satisfies the conditions e„ = o(l) and ne n = w(l), and finally 
assume that the power of active users are = j n — n 1 / 5 , 
Vi € Hi- As predicted by the analysis, for large values of 
n the adaptive procedure yields a significant improvement 
over the non-adaptive scheme, e.g., for n = 100 we gain 
two orders of magnitude in error probability after 4 cycles 
of exploration. The improvement attained for small values of 
n is also considerable, e.g., for n — 20 it is one order of 
magnitude. It is noteworthy that the choices of e„ = ri~ 2 / 3 and 
Pi — In — n 1 / 5 have been arbitrary and extensive simulations 
show that the gains are not very sensitive to these choices. 

Next, we investigate the agility gain. Define the normalized 
sampling budget as the aggregate sampling budget divided 
by the number of channels. In Figure |2] we plot this quan- 
tity against the number of channels n, when requiring both 
approaches to have the same error probability for detecting 
T = 2 holes. For the adaptive scheme we consider the 
performance with K — 1, . . . , 5 cycles of exploration. Again 
we consider the choices of e„ = n~ 2 / 3 and Pi = j n = n 1 / 5 . 
It is seen that the agility is improved by increasing K and 
for K — 5 the adaptive procedure requires about 80% less 
sampling budget than the non-adaptive procedure. In other 
words the adaptive procedure is 5 times faster than the non- 
adaptive scheme for detecting a hole with error probability is 

Figure [3] depicts the results for assessing the necessary and 
sufficient conditions on the scaling of the power of the active 
users provided in Corollaries Q~| and [2] for non-adaptive and 
adaptive procedures. This is intended to depict that there exists 
a region where the non-adaptive procedure fails to identify 
spectrum holes, while the adaptive procedure succeeds. We 
assume that e„ = n a ~ 1 for a 6 [0, 1] and j n — rfi for 
(3 > 0. By using Corollaries[T]and|2]we find that the necessary 
and sufficient conditions for successful detection by the non- 



Fig. 3. Detectability regions for M = 5, K = 4, e„ 
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adaptive and adaptive procedures are (3 > ^f- and (3 > ^gr, 
respectively, where M' > 2 K (M — 2) + 2. The dashed lines 



Fig. 4. Average number of observations taken per channel versus sparsity 
of the holes in the adaptive and the sequential method of |7 ] for achieving 
P A (n) = lO" 4 . 



depict these theoretical boundaries and split the space of (a, /3) 
into three separate regions. We also provide the simulation 
results for some pairs of (a, 0) in each of these regions. It is 
expected that in the lower region both procedures fail; in the 
middle region only the adaptive procedure succeeds and in the 
upper region both succeed. The simulation results (except for 
a few points) support the theoretical asymptotic predictions. 
In the simulations we have set M — 5, K = 4, and n = 1000. 
We remark that requiring as low as about 5 sensing samples 
per channel is substantially less than the compressed-sensing- 
based approaches (e.g., iflOl ), that for obtaining a reliable 
estimate of the PSD take samples at a rate equal to at least 
half Nyquist rate, which for a wideband channel is large. 

In Figure|4] we compare the agility of our proposed adaptive 
procedure with that of the quickest search []7| for achieving 
the target error probability P A (n) = 10 -4 . We compare the 
two schemes at different sparsity levels for the distribution of 
the spectrum holes. Specifically, we consider e n — n a ~ 1 for 
a € [0,1] where smaller values of a correspond to sparser 
distribution of the holes. The objective is to identify T = 2, 3 
holes out of n = 1000 channels. The powers for the active 
users are assumed to be 7 = -nP for ft — 1/5, 1/10 and we 
deploy K = 4 cycles of exploration. It is observed that for 
highly sparse distribution of holes (small enough values of a,) 
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Fig. 5. Illustration of the exploration phase in eliminating the non-holes for n = 1000 in three different system settings. 



our proposed adaptive procedure performs substantially better 
and for higher values of a which correspond to the case that 
there exist abundant number of spectrum holes, the quickest 
search method is outperforming. 

Finally, Figure [5] illustrates three snapshots showing the 
efficiency of the exploration phase in eliminating the non-holes 
for three different system realization. We consider n = 1000 
channels and set e n = n~ 2 / 3 , 7„ = n 1 / 5 . Figure [5] plots 
the performance during K — 4 cycles of exploration, where it 
shows that the number of non-holes reduces considerably after 
each exploration cycle. Also for these system realizations we 
have 10 holes, and we have observed that all the holes are 
retained throughout the exploration phase. 



VI. Conclusions 



In this paper we have presented an adaptive sensing method- 
ology for identifying multiple spectral holes in a wideband 
cognitive radio system. By gradually adjusting the measure- 
ment process using information gleaned from the previous 
measurements we are able to significantly improve the prob- 
ability of correctly detecting multiple spectrum holes. This 
dramatic gain is patent both in the theoretical analysis and in 
the simulation results. More importantly, in the cognitive radio 
setting this improvement translates into a significant increase 
in agility, allowing spectrum holes to be identified quickly and 
therefore widening the window of time for opportunistic data 
transmission. Such dramatic improvements are not possible 
without the use of adaptive sampling techniques, as demon- 
strated. 



Appendix A 
Proof of LemmaQ] 



Let F(t;m) = P(Y t < t) denote the CDF of 
the Gamma (M,a m ) random variables {Y^Li- Also let 
Q (l) (w;m) = P{W$ < w) denote the CDF of W$ . 



Therefore, 

Q(i) (w,m) 



1 



E 

k=0 



P(W$ <w) = P(Y {l) < wb m ) 

F(wb m ;m)] k [l - F(wb m ;m)] m k 

F(wb m ;m) 



1- [1-F(wb m ;m)] m 



k=0 



1 - F{wb„ 



(26) 



Now recall that for a Gamma (M, a m ) distribution 



F(y;m) 



T{M)a, 



x 

CCrn 



exp 



x 



dx 



Noting that for < x < y we have exp(— y/a m ) < 
exp(—x/a m ) < 1 we immediately get 



exp(-y/a m ) ( y_ 

r(M + i) \am 



< F(y;m) < 



1 



T(M+1) V" 



y 



r(M+i) 



, after 



By noting that we had defined b m = a r 
some simplifications we find the following bounds above for 

F(wb m \ m) 



w 



hi 



M r(M + i)\ , , w 

■ exp I -w M ■ — < F(wb m ;m) < — 

m \ to / m 



By using the little-o notation as m —> oo we have 
F(wb m ;m) = ^-(1 + o(l)) . Hence, for the term 

[1 — F(wb m ; m)] m as m — > oo we have 



[1 - F(wb m ;m)] 



,M 



m 



-(1 + 0(1)) 



1 h o(m L ) 

m 



exp 



to log 1 



A I 



W 

m 



+ o{m 1 ) 



w 



A I 



exp [- 
exp(— w M ) 



'(1)] 



(27) 
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On the other hand for every fixed k we get 

k 



F(wb m ;m) 
1 - F(wb m ;m) 



k\ 



-3^(1 + 0(1)) 



TO 
"fcT 



fc! 



— (1 + 0(1)) 
TO 



MA- 



TH 

yield 



fc! 



(28) 



Equations 

Q(i){w;m) = 1 - cxp(-w M )^2 
which is the desired result 



1 W KM 



k=0 
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Appendix B 
Proof of Lemma|2] 

For the non-adaptive procedure by invoking the assumption 
e n — o(l), we know that j n = oj(1) from Corollary Q] 
Throughout the analysis for the adaptive procedure we also 
assume that 7„ = uj(1) and characterize the precise necessary 
and sufficient condition for the scaling rate of j n at the end 
of Section JV] We start with the following remarks. 

Remark 3: Let X ~ Gamma (M,a). Then for any A > 
we have P(X < A) > 1 — exp (-£). 

Remark 4: For B distributed as Binomial(TO, a) and b < 
E[B] we have 

m—b l 
'ma\» 

~) ' 



P(-B < 6) < 



Let us define Jq = Qq n 7^0 an d f° r fc = 1, • • • , if let J7fe = 
SfcflKo be the set containing the indices of the spectrum holes 
retained by the fc t/l exploration cycle. By the definition of mk 
we have |j7fc| = mfe. Also, for any i e Cffc-i define 

7; fc = i{r, fe <A fe (i + 7 „)} , 



where 

r / 1 if A is true 

1{ - 4} = ( if ^isfalse ' f ° r fc = 1 -"»^- 

According to the algorithm in Table 1 for the i th channel, 
which can be either a hole or a non- hole, Tf is 1 if is retained 
by the k th exploration cycle and is otherwise. Therefore, 
the number of holes retained by the k th exploration cycle is 

m k = Y,iej k -! T i- B y notin g th at \ Jk-i\ = mk-i we find 
that 

mk ~ Binomial(mfc_i, ak), for fc = 1, . . . , K , 
where 

a k = P(T t k = 1 |ie Jk-i) 

= P{Y l k < A fc (i + 7„) |ie < l . 

Based on d20l >. is distributed as Gamma (Mfe, 1) for i G 
Jk-x- Therefore, by using Remark |3]we find that 



afe > 1 - exp(-Afe(l + 7n)) 



(29) 



1 + 7-n 



P TOfe < 



7n 



l+7« 
m k -i 



m k -i 



> 1 — cxp ( — 

which concludes the result. 

For sufficiently large n (and thereof sufficiently large 7 n ) 
we have , > exp(— Afc(l+7 n )) which in conjunction with 



l+7r. 



provides that for sufficiently large n 



In 



1 + 7 n 



Therefore, by setting 6 = 1 ^ ■mk-i, for the random variable 
TOfe ~ Binomial(TOfc_i, ak) we have b < E[mfe] = a k m k -\. 
As a result, for this choice of 6 the condition of Remark |4] is 
satisfied and, for sufficiently large n we find that P(m k < b) 



P(m k <b) = P[m k < 



l + 7n 



• mk-i 



< 



< 



((l-o fc )(l + 7n ))' 



((l + 7n)exp(-A fe (l+7„))) " 



1+7»V 

7n 



cxp ( — — ( log(l + 7„) - A fe (l + 7„) + 7„ log ( 1 + — 

V 1 + In \ \ In 



<1 



< 



/ ( log(l + 7 ») . 1 
exp I — — A fe + 



< exp ( 



\ 1 + In 

mk-V 



l + 7n 



(30) 
(31) 



(32) 
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is lower bounded according to (l32l i. Note that (l30l holds 
according to Remark |4] (|3H is obtained by invoking the 
inequality given in d29l i, and noting that at < 1 as a k is a 
probability. Finally the inequality in (f32l > is justified by noting 
that for sufficiently large n and any a > 



log(l + 7 „) 



1 



< 



1 



1 + In 1 + In fl" 

as all terms in the LHS and RHS except the constant — \ k 
are diminishing in the asymptote of large n. Recall that J k C 
J7fc-i and consequently, mt < m k -i. This fact in conjunction 
with d32l > provides that 

Appendix C 
Proof of Lemma[3] 

Let Co = Go n "Hi and for k = 1,...,K define C k = 
Gk H "Hi which contains the set of the indices of the non- 
holes contained retained by the k th exploration cycle. By the 
definition of 4 we have and \£ k \ = 4- By recalling the def- 
inition of T* and taking into account their independence, the 
number of non-holes that the k th exploration cycle retains is 
4 = J2iec k i -^i ■ Applying Hoeffding's inequality provides 

4-1 \ ^ r, ( 21 fc_ 



p(|4-E[4] 

On the other hand 



> 



bgn 



< 2 exp 



E[4]= £ E[2*] 



(bgn) 2 

= 1 h'e£ fe _i) 



(33) 



(34) 

According to (1201 1. for i g 4--i, is distributed as 
Gamma (M k , 1 For any i £ Ct-i we denote the CDF 

of Gamma(Mfc, 1 + Pi) by F*(t;n). Since > j n we get 

P(2? = 1 |t e A-i) = ^ fe (A fe (i + 7 n)) 
<F?(\ k (l+ Pi )) 



1 



r(M k )(i+ Pl ) M 



Afc(l+Pi) 



„M-1 



exp 



l+Pi 



dx 



1 



„M-1 



exp (— x) dx 



r(A/ fc ) 
l 

~~ 2' 

where the last step holds as we had defined X k as the median of 
Gamma(M fc , 1). As a result, from d34ji we find that E[4] < 
= %i. By selecting c fc = |M < i an d recalling d33j 

we get the desired result. Finally note that F^^\ k {l + 7„)^ 
is monotonic in pi. Therefore, the necessary and sufficient 
condition for having E[4] = %i is that F t k (\ k (l + j n )) = 
F i fc (Afc(l+pi)), which in turn requires that pi = j n for i € "Hi. 



Appendix D 
Proof of Theorem[2] 

Similar to the proof of Theorem [TJ for quantifying the 
performance of the robust detection scheme, we consider the 
worst-case scenario and set pi = j n for all i <E Hi- Define the 
event A k as in ( T37] ). 

Lemma [2] and [3] characterize the probability of event A k 
given rrik-i and 4-i> and state that, 

(k = P(A k | m fc _ 1( 4-i) 



> P m fc _i >m k > ( 1 - 



+ P 



1 + >, 



(7 J - r^)4-i < 4 < (\ + I 4-i 

\V2 bgn/ \2 bgn/ 



. 1 / m-fc-i , „ , 24_i 

> 1 - cxp( ) - 2exp(-- — ) 

n Q y v (bgn) 2 ' 



(35) 



By concatenating all the bounds we conclude that the event 
(j4i, A2, . . . i Ak) is a sufficient condition for the event A 
defined in ([38]). 

By noting that given m k -\ and 4-i> the event A k is 
independent of events Ai for i < k, we immediately conclude 
that the event A holds with probability at least 



P(A)>P(A U A 2 ,...,A K ) 



K 



K 



= ]JP(A k I ro fc _i,i fc _i) = JJCi • 

i=l i=l 

In the asymptote of large n, for any fixed K we have 
(conditions on the choice of K are discussed in Theorem 



lim 



('"IT 



+ 7n 



A" 



= 1 



and lim I = 1 

n-i-oo V 2 log nJ 



and lim 



1 



1 \ A " 



1 



?i->oo \2 log 71/ 

which provide that in the asymptote of large n 



(36) 



A = yn K = m and l K = ^ 

with probability at least YiiLi Ci- On the other hand, by recall- 
ing the assumptions e n = o(l) and ne n = w(l), according to 
the the law of large numbers we get to n(l — e n ) = n 
and too = ne n = Hence, we can ensure that for 

k = 1,...,K, 4 — ^ 1 as n ~ * 00 ■ More specifically, by 
selecting a < lim,^^ '"g^"" for (i we have 

,. wo ne n 

lim = lim = +00 , 

n— too n a n— >oo fl a 



A k = { m k -x >m k >(l- — - — )m fe _i and 

1 + In 



1 1 

2 ~ be 



r ^)4-i < 4 < (J + r^)4-i| 

log n/ \2 bgn/ J 



(37) 



1 \ K /l 1 \ K /l 1 \ K 

A = I to > m K > [ 1 - too and ( - - — — ■ — ) l < i K < { - + — — ■ — ) l 



1 + In 



2 bgn 



2 bgn 



(38) 
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which shows that 



hm cxp( -J = 

n— >oo XI 



(39) 



and 



lim 



2io 



■ ~ (logn) 2 
which shows that 



2n 

I™ j, ^ 

n-Hxs (log n) z 



lim 2 cxp I — fc * 1=0 
' (logn) 2 



(40) 



Equations (f39t-d40b and (f35T > yield that lim„^oo Ci = 1. By 
following the same procedure, the same result for all can be 
recovered. As a result, for sufficiently large n, with probability 
1 we have rtiK = mo an d = £o/2 K - 

Now let us define no and hi as the number of holes and 
non-holes retained after the exploration phase. By following 
the same line of argument as in the proof of Theorem Q] and 
by appropriately applying Lemma Q] we can show that 



P A (n) = 1 - f 1 + [(1 + ln) M ■ ^n}- 1 ) ■ (41) 

Note that for sufficiently large n 

n m K m K _ n K K 

m Ik t m 

Plugging this asymptotic value of into ( HTt completes the 
proof. 



— T 



Appendix E 
Proof of Theorem[3] 

Based on the structure of the experimental design, for 
a given Mn sampling budget the non-adaptive procedure 
takes M observations per channel. By equating the error 
probabilities of hole detection in the non-adaptive and adaptive 
schemes given in (flZb and d23l we find that as n — > oo 



(1 



ln) M e n 



{l+ln) MK+1 -2 



which shows that 



M 



A'+l 



M 



K 



log(l + 7„) 



< M 



(42) 



Next, let us evaluate the sampling budget required by the non- 
adaptive procedure given that in the detection phase we take 
Mk+i samples per channel that has survived the exploration 
phase. As discussed earlier, the characterization of P A (n) 
conveys that the optimal experimental design is to take 1 
sample per channel in each exploration cycle. Therefore, the 
aggregate amount of samples taken throughout exploration and 
detection phases is Sfc=o( mfc + ^ n otner hand, from 
lemmas [2] and [3] we find that in the asymptote of large n 
we have = mo and £k = -k%i\ with probability 1 for 



1,...,K. Recalling that 



and £ 



we find that the sampling budget required by the adaptive 
procedure is asymptotically equal to 
K-l K-l 

k=0 k=0 



n 2 



1 



< n 2 



Mk+2 

2 k-i ^ 2 K 
M K+ x 

OK 



(43) 



Equations d42l and d43l ) together show that the sampling bud- 
get of the adaptive procedure is asymptotically upper bounded 
by n(2+M/2 K ). As a result, for maintaining P A (n) — P NA (n), 
the asymptotic ratio of the sampling budget required by the 
non-adaptive procedure to that required by the adaptive scheme 
is lower bounded by 

-l 



Mn 



n(2 + M/2 K ) 



2 1 



Appendix F 
Proof of Corollary[2] 

First we find how Mk and M should be related such that 
they give rise to the same sampling budget required by the 
adaptive and non-adaptive schemes. By following a similar 
line of argument as in the proof of Theorem [3] and (l4Jt we 
find that 

K 



Mn = J2( m k +4) < n 



k=0 



2* - 1 M K +i \ 
2 k-i 2 K }' 



which provides M K+1 > 2 K (M - 2) + 2. On the other hand, 
from (l23l forcing P A (n) — > yields that it is necessary and 
sufficient to have 



In = W 



1 



2 K e 



Setting M' = Mk+i establishes the desired result. 

Appendix G 
Proof of TheoremO 

A key constraint that guarantees retaining almost all the 
holes throughout the exploration phase is cast in ((36) as 

/ 1 \ K 
lim (1 ] = 1 . 



(44) 

n—toc \ 1 + 7n ' 

According to Corollary |2] guaranteeing this equality necessi- 
tates that K does not grow faster than 7„, i.e., 



Keo(j n ) while j n = oj 



1 



2 K e 



(45) 



Define 

Q n = {j n | ^ n does not satisfy d24li} . (46) 
From (l45T>-(l46b we obtain that 

1 



K = sup 7„ 



■Zk— ^'log^ + 2 = log- 



(47) 
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By taking into account the characterization of M' in Corol- 
lary 12 we find the following upper bound on K. 



(2 K (M - 2) + 2) log K + 2 < log — 



(48) 



Next, we identify the scaling law of K with respect to n such 
that the asymptotic inequality above is satisfied. As K scales, 
the constant summands (i.e., +2) will be dominated by other 
summands, the inequality above can be stated as 



log(A/ - 2) + K + log log K < log log 



1 



(49) 



(M — 2) being a constant as well as log log K are both 
dominated by K, giving rise to the following upper bound 
on K 

K < log log — . (50) 

We also need to examine the constraints on K that guarantee 
that the exploration phase discards the non-holes as expected 
in the proof of Theorem [2] More specifically, we need to 
ensure that the two following equalities hold. 



lim 



i \ K /i i \ K 

= 1 and lira ( — + ] = 1 , 



log n 

which indicate that K must satisfy 

K e o(log n 



2 loe 



(51) 
(52) 



By recalling the assumption ne„ = w(l), or its equivalent form 
— = o(ri), it can be readily verified that the bound given in 
( f50b is stronger than that in (|52V By taking into account the 
monotonicity of the detection performance with increasing K, 
the optimal choice of K is the upper bound in (l50l . 

Appendix H 
Proof of RemarkQ] 

The probability that a fixed set of channels U with cardi- 
nality T contains only holes is 

P{U C Ho | £>„) = Y[P(i£H \ V n ) 



ieu 



n 

ieu 



P{i 6 Hp) ■ P{V n \ieH ) 
P{V n ) 

P(ieH )-P(X t \ieHo) 



n 

ieu 

n 

ieu 





;-ii^ii 2 ; 




exp ( ||X 4 || 2 ) + ^(J+p^M exp 





1 - e,. 



(l+ Pl ) MeXP \l+ Pl 



\Xi 



(53) 



where the above expansion follows from Bayes's rule and 
the statistical independence of the observations made over 
different channels. Hence, the MAP detector that maximizes 



P(U C Ho | T> n ) among all sets of channels with cardinality 
T is given by 



where 



arg max logPlU C Ho I T> n ) 

U: \U\=T 



arg min y log 

ieu 



1 + 



Ui 



1 ( Pi 

(l+ Pi )M exp {l+P 



l^ll 2 



(54) 



(55) 



Therefore, U„ k AP contains the indices of the T smallest elements 
of the set {U\, . . . , U n }. 
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